Evaluating Discourse Processing Algorithms
نویسنده
چکیده
In order to take steps towards establishing a methodology for evaluating Natural Language systems, we conducted a case study. We attempt to evaluate two different approaches to anaphoric processing in discourse by comparing the accuracy and coverage of two published algorithms for finding the co-specifiers of pronouns in naturally occurring texts and dialogues. We present the quantitative results of handsimulating these algorithms, but this analysis naturally gives rise to both a qualitative evaluation and recommendations for performing such evaluations in general. We illustrate the general difficulties encountered with quantitative evaluation. These are problems with: (a) allowing for underlying assumptions, (b) determining how to handle underspecifications, and (c) evaluating the contribution of false positives and error chaining.
منابع مشابه
EVALUATING DISCOURSE PROCESSING ALGORITHMS (Appeared in ACL89, Vancouver)
In order to take steps towards establishing a methodology for evaluating Natural Language systems, we conducted a case study. We attempt to evaluate two di erent approaches to anaphoric processing in discourse by comparing the accuracy and coverage of two published algorithms for nding the co-speci ers of pronouns in naturally occurring texts and dialogues. We present the quantitative results o...
متن کاملCustomizing And Evaluating A Multilingual Discourse Module
In this papeh we first describe how we have customized our data-driven multilingu~fl discourse module within our text understanding system lor dill'erent lm~guages and for a particular NLP application by utilizing hierm'chic~dly organized discourse KB's. Then, we report qum~titalive and qmditative findings from ewduating the system both with and without discourse processing, ~md discuss how res...
متن کاملEfficient Processing of Underspecified Discourse Representations
Underspecification-based algorithms for processing partially disambiguated discourse structure must cope with extremely high numbers of readings. Based on previous work on dominance graphs and weighted tree grammars, we provide the first possibility for computing an underspecified discourse description and a best discourse representation efficiently enough to process even the longest discourses...
متن کاملCan Discourse Relations be Identified Incrementally?
Humans process language word by word and construct partial linguistic structures on the fly before the end of the sentence is perceived. Inspired by this cognitive ability, incremental algorithms for natural language processing tasks have been proposed and demonstrated promising performance. For discourse relation (DR) parsing, however, it is not yet clear to what extent humans can recognize DR...
متن کاملProsodic and Lexical Correlates of Swedish Discourse Markers in Spontaneous Dialogue
Discourse markers are words or phrases that speakers use at the beginning of a contribution to signal how it relates to prior discourse. They mark changes in the global discourse structure by e.g. signalling the beginning of a new topic or the return to a previous topic. However, words that are used as discourse markers often also have a sentential function. If discourse markers are to be used ...
متن کامل